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Summary 


This  paper  gives  upper  and  louer  bounds  on  the  rank 
of  two  matrices,  and  discusses  their  connection  with  the 
that  rank  be  additive  over  a  set  of  matrices. 


r 


of  the  sunn 
condition 


1.  Introduction 


An  important  theorem  in  statistics,  due  to  Cochran, [2],  sayB 
that  if  are  symmetric  n  x  n  matrices  for  which 

(I)  A]+—+Am  =  I 

(II)  rank  (A^)+# • »+rank  (A^)  =  n, 
then  A^Aj  =  0,  i  ^  j. 

Various  versions  of  this  theorem  have  been  considered,  [ l] ,  [3]> 

and  [4].  Smiley  [5]  has  recently  offered  another  proof.  The  key  point  of 

the  theorem  seems  to  be  that  the  rank  of  tne  sum  is  the  sum  of  the  ranks, 

and  this  note  was  undertaken  with  the  intention  of  finding  necessary  and 

sufficient  conditions  that  r(A+B)  =  r(A)  +  r(B),  for  arbitrary  A  and 

B.  Searching  for  such  conditions  led  to  looking  for  sharp  upper  and  lower 

bounds  on  r(A+B),  and  there  did  not  seem  to  be  much  in  the  literature  on 

this,  other  than  the  trivial  r(A+B)  <  r(A)  +  r(B).  An  improved  upper 

bound  and  a  lower  bound,  which  seems  to  be  new,  were  found;  these  are  given 

in  Section  4>  Theorem  1.  Conditions  for  r(A+B)  “  r(A)  +  r(R),  (Theorem  2)., 

follow  immediately  from  Theurem  1,  so  that  Theorem  1  appears  to  be  more 

important  than  the  result  we  set  out  to  prove,  hence  the  title  of  this  note. 

* 

As  indicated  above,  we  hse  r(A)  to  designate  the  rank  of  the  matrix  A 
and  d(cf)  for  the  dimension  of  the  vector  space  f’f.  We  also  use  I  and  0 
indiscriminately  to  represent  the  identity  and  zero  matrix  of  whatever  size 
necessary  to  fit  the  other  matrices  in  a  particular  expression.  In  the 
usual  way,  for  two  vector  spaces  and  R0  we  let  be  the 

space  of  vectors  p^  +  p^  P]_€^  ^2e^2*  a-'-so  ma^e  extensive 

use  of  the  projector  of  a  vector  space.  The  properties  of  a  projector  are 
easily  established;  we  summarize  them  in  the  next  section* 
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2.  The  projector  of  a  space 

Given  a  vector  subspace  Of  of  the  space  of  lxn  vectors,  there  is  a  unique 
symmetric,  idempotent  n  x  n  matrix  A  satisfying 

(I)  aA  =  a  if  and  only  if  a  c  Of  • 

(II)  rank  of  A  =  dimension  of  Of, 

This  symmetric,  idempotent  matrix  is  called  the  projector  of  Of* 

If  a^,a2> are  an  orthonormal  basis  of  Of,  then  A  =  a{a]+,**'ta|at» 
or  if  A  is  a  matrix  whose  rows  are  a  basis  of  Of,  then  A  =  A,(AA')”^A. 

Any  1  x  n  vector  p  may  be  represented  uniquely  as  the  sum  of  a  vector  in 
Of  and  a  vector  in  the  orthogonal  complement  of  Of,  by  writing 
p  ~  pA  +  p(l  -  A),  The  projector  of  the  orthogonal  complement  of  Of 
is  I  -  A,  and  r(A)  +  r(l  -  A)  =  n. 

We  will  use  an  overscore  to  represent  the  projector — given  a  matrix 


B, 

the  notation 

B  will  mean  the  projector  of  the  row  space  of  B;  thus 

B 

is  the 

unique 

matrix  satisfying 

(I) 

B»  =  B 

(II) 

BB  =  B 

(III) 

B  =  TB 

for  some  T,  i.e.  the  rows  of  B  are  in  the  row 

space  of  B. 

(IV) 

BB  =  B 

(V) 

r(B)  = 

r(B) . 

The  projector  provides  a  convenient  method  for  representing  the 
rank  of  a  composite  matrix  as  a  sum — for  example,  we  write 


and  since  the  rows  of  AB  are  in  the  row  space  of  B,  they  may  be  removed 
by  elementary  row  operations;  thus 
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a  /Att-BK 

(1)  r(*)=r(  j  =  r(B)  ♦  r[A(I-B)], 

since  the  rows  cf  A(I-S)  are  orthogonal  to  the  rows  of  B. 

For  the  column  version  of  this  formula,  we  write 

r(R,S)  =  r(S)  +  r[(I-F)R] 

where  S"’’  is  the  projector  of  the  column  space  of  S,  or  the  projector 
of  the  row  space  of  S'* 


A  slightly  more  complicated  application  of  this  device  gives  this 
formula: 

(?)  r(“)  =  r(T)  +  r(S)  +  r[  (I-S^)R(I-T)  ] 

where  S'  is  the  projector  of  the  column  space  of  S  and  T  the  projector 
of  the  row  space  of  T.  To  prove  this,  we  write 

,RSv  ,RT+R(I-T)  Sx  _  /R(I-T)  Sv 
r<T0  =  r(  T  0)=r(  T  0) 


=  r(T)  +  r(R(I-T),S)  =  r(T)  +  r(S)  +  r[(I-S ' )R(I-T)] . 


3*  Bounds  for  the  rank  of  a  product 

We  need  the  results  of  this  section  to  prove  the  main  theorem.  We 
want  bounds  on  the  rank  of  ABj  to  get  them  we  write 

r(B>  =  r(®B) 

then  use  formula  (l)  to  get 


(3)  r(B)  =  r(AB)  +  r[B(I-AB)], 
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where  AB  is  the  projector  of  the  row  space  of  AB. 
projector  of  the  row  space  of  A,  then  IB(I-AB)  =  0, 

T,  5  =  TA,  Thus  (3)  may  be  written 

r(B)  =  r(AB)  +  r[ (I-I)B(I-AB)] , 

which  gives  these  bounds  on  r(AB), 

r(B)  -  r[ (I-I)B]  <  r(AB)  <  r(B), 

and  the  weaker  result, 

r(B)  -  r(I-S)  £  r(AB)  <  r(B). 

The  latter  is  known  as  Sylvester’s  law  of  nullity,  usually  written  as 

(4)  r(A)  +  r(B)  -  n  <  r(AB)  <  r(B) 

where  r(l-X)  =  n  -  r(J)  =  n  -  r(A),  assuming  that  A  is  p  x  n  and  B 
is  n  x  q. 

4.  Bounds  for  the  rank  of  a  sum 

This  theorem  gives  upper  and  lower  bounds  for  the  rank  of  a  sum. 

Theorem  1.  Let  A  and  B  be  two  matrices  of  the  same  size,  let  their 
row  spaces  be  and  their  column  spaces  C  and  Then 

(5)  r(A)  +  r(B)  -  d^  fl  «2)  -  n  6^)  <  r(A+B) 

and 

(6)  r(A+B)  <  r(A)  +  r(B)  -  maxfd^  D  ,  d(C^C]C^)} . 


Now  if  I  is  the 
since  for  some 
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We  first  prove  (6),  which  is  quite  easy,  then  (5),  which  is  not  so 
easy.  We  have 

r(A+B)  £  d(»1-Hl2)  =  d^)  +  d(R2)  -  dO^Hf^). 

Since  d(R^)  =  r(A)  and  d(R2)  =  r(B),  we  have 

r(A+B)  <  r(A)  +  r(B)  -  dfR^Rg), 

and  a  similar  argument  on  the  column  spaces  yields  (6), 

Using  the  fact  that  r(^)  =  dfR^+R^)  =  r(A)  +  r(B)  -  d(R^flR2) 
and  r(A,B)  =  d(C^+(L)  =  r(A)  +  r(B)  -  a(<?^  fl C^) ,  we  may  write  (5)  in 
the  form 

(7)  r(g)  +  r(A,B)  <  r(A+B)  +  r(A)  +  r(B). 

Now  none  of  the  five  ranks  in  (7)  is  changed  if  we  replace  A  and  B 
by  PAQ  and  PBQ  with  P  and  Q  nonsingular.  Thus  we  may  assume  A  and 
B  have  any  form  obtained  by  performing  identical  elementary  row  and 
column  operations  on  each  of  them.  We  may,  for  example,  assume  that  A 
and  B  have  this  form 


/In 

0 

°\ 

/R 

s 

0  \ 

(° 

0 

0 ) 

B  =1  T 

0 

0  ) 

\0 

0 

o  / 

\o 

0 

I J 

In  that  case,  we  have  r(g)  =  r(l^)  +  r(S)  +  r(l2),  and  r(A,B)  = 
r(l^)  +  r(T)  +  r(l2),  so  that  we  will  prove  (?)  if  we  can  prove  that 

(8)  r(  I  )  +  r(S)  +  r(T)  <  r(£  +  rC1^  jj), 

dropping  the  subscript  on  I,  assumed  to  be  n  x  n.  Using  formula  (2), 


we  write 
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r(£  q)  +  r(:*R  ®)  =  2r(S)  +  2r(T)  +  *•[ (I-S • ) (I+R) (I-T) ]  +  r[ (I-S •  )K (I-T) ] 
end  this  leads  to  the  inequality 

(9)  r(R  g)  +  r(X^R  g)  *  2r(S)  +  2r(T)  +  r[(I-F)(I-T)], 

since  r[(I-S^)R(I-T)]  =  r[ (I-S7) (-H) (I-T) ] .  Dsing  (4)  with  n  =  r(I), 
we  have 

r[  (I-S7)  (I-T)]  *  n  -  r(S)  +  n  -  r(T)  -  n 
and  putting  this  in  (9)  we  have 

r(£  »>  +  r i1**  q)  2  r(S)  +  r(T)  +  n. 

This  establishes  formula  (8),  and  hence  (7),  which  is  equivalent  to  (5); 
the  proof  of  Theorem  1  is  complete. 

5.  The  rank  of  the  sum  and  the  sum  of  the  ranks. 

Theorem  1  enables  us  to  characterize  the  condition  that  rank  be 
additive  for  a  pair  of  matrices: 

Theorem  2.  Let  A  and  B  be  two  matrices  of  the  same  size,  with 
row  spaces  R^,  and  column  spaces  C^t  C Then 

r(A+B)  =  r(A)  +  r(B) 

if«  and  only  if 

din^R^DR^)  =  dim(C^  (1  C =  0. 

The  proof  follows  immediately  from  (5)  and  (6). 


The  fact  that  rank  is  additive  for  a  set  of  matrices  is  a  strong 
condition.  We  will  develop  some  consequences  of  this  condition  in  a 


7 


few  conclusions  leading  up  to  Theorem  3#  which  is  a  sort  of  generalized 
Cochran’s  Theorem,  First,  we  point  out  that  if  rank  is  additive  on  a 
set  of  matrices,  it  is  additive  on  subsets: 

Lemma  1,  If  rank  is  additive  for  a  set  of  matrices: 

r(A1  +  A2  +  •••  +  Am)  =  r(Ax)  +  r(A2)  +  •••  +  r(An) , 

then  rank  is  additive  for  sets  of  matrices  formed  by  adding  distinct 

A's  from  that  set  —  for  example,  if  r(A^  +•••+  A^2)  =  r(A^)  +•••+  r(A^2), 

then 


r(A1+A5)+r(A3+Ag+A10)+r(A6+A9)  -  r^+A^+A^+Ag+A^+A^Ag) . 

To  give  the  gist  of  the  proof,  consider  the  example  of  the  theorem. 
Let  A  =  A^+A^,  B  =  A^+Ag+A^,  C  =  A^+A^,  and  let  D  be  the  sum  of  the 
matrices  in  A^,,.,,A^2  not  included  in  A,  B,  or  C,  that  Is, 

D  =  A  2+A^+A^+A^+A  ^  ^  •  Then 


r(A^)+,,#+r(A^2)  =  r(A+B+C+D)<r(A+B+C)+r(D)£r(A)+r(B)+r(C)+r(D)£r(A^)+##«+r(A12) 


Thus  all  inequalities  are  equalities,  and  r(A+B+C)  =  r(A)  +  r(B)  +  r(C). 
The  converse  of  this  lemma  is  not  true  —  rank  can  be  pairwise  additive 
but  yet  not  finitely  additive,  for  example,  for  these  three  positive 
semi -definite  matrices: 

0  0-  0  O'  0  «)' 

rank  is  pairwise  additive  but  not  additive  over  all  three. 


Lenna  2.  If  r(A+B)  -  r(A)  +  r(B)  and  if  A  and  3  each  commute  with 
their  sum,  S  =  A+B,  then  AB  =  BA  =  0, 
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Proof:  Since  A  commutes  with  S,  it  commutes  with  S  -  A  =  B. 
Since  the  row  space  of  AB  =  BA  is  in  both  the  row  space  of  A  and 
the  row  space  of  B,  Theorem  2  shows  that  AB  =  BA  =  0. 


We  are  now  able  to  give  this  generalized  version  of  Cochran’s 
Theorem  mentioned  in  the  Introduction.  We  find  that  symmetry  has  no 
essential  role  in  the  theorem,  and  Condition  II,  that  2A^=  I,  can  be 
replaced  by  the  condition  that  each  Aj  commutes  with  £A^. 

Theorem  3.  Let  be  square  matrices  for  which  rank  is 

additive. 


r(A1+,,«+Am)  =  r(A^)+#*»+r(Am), 
and  let  S  be  their  sum:  S  =  A,+***+A  .  Then 

-  -  l  m  - 

(10)  AjA  =0  for  i/j 

if  and  only  if  each  of  the  A’s  commutes  with  the  sum: 


(ll)  A^S  =  SA.  ,  i  =  1,2,.. .,m. 


Proof:  Condition  (10)  certainly  implies  (ll).  To  prove  that  (ll) 

implies  (10)  we  prove  that  A-^A^  =  A^A^  =  0.  Let  A  =  A^,  B  =  A  ,  and 

C  =  A^+«**+Am.  Using  Lemmas  1  and  2,  we  know  that  A(B-i-C)  =  (B+C)h  =  0, 
2 

and  hence  A  =  AS  =  SA.  Going  to  a  similarity  transformation  if 

G  0 

necessary,  we  may  assume  that  A  has  the  form  q),  with  G  non¬ 
singular.  Then  we  write  A  +  B  +  C  =  S  in  the  form 
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Since  A  =  AS  =  SA  and  G  is  non-singular,  It  follows  that  S 
must  have  the  form  f  T  ~  I  •  Thus 


(s  U 

/B1  Ba\  +  /°  1  °2\  =/°  0  \ 
\B3  V  Vc3  \)  V> 


Now  according  to  Lemma  1,  r(B+C)  =  r(B)  +  r(C),  and  it  follows  that 
the  rows  of  (B^  B^)  and  of  (C^  C^)  must  all  be  zero,  or  else  Theorem 
2  would  be  violated.  By  the  same  argument,  the  columns  of 


and 


0 


must  be  zero.  Thus  B  = 


-CD 


a 


and  AB  =  BA  =  0. 
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